High Performance MPI over the Slingshot Interconnect

نویسندگان

چکیده

The Slingshot interconnect designed by HPE/Cray is becoming more relevant in high-performance computing with its deployment on the upcoming exascale systems. In particular, it empowering first and highest-ranked supercomputer world, Frontier. It offers various features such as adaptive routing, congestion control, isolated workloads. of newer interconnects sparks interest related to performance, scalability, any potential bottlenecks they are critical elements contributing scalability across nodes these this paper, we delve into challenges poses current state-of-the-art MPI (message passing interface) libraries. look at performance when using nodes. We present a comprehensive evaluation communication libraries including Cray MPICH, Open- + UCX, RCCL, MVAPICH2 CPUs GPUs Spock system, an early access cluster deployed Slingshot-10, AMD MI100 Epyc Rome emulate Frontier system. also evaluate preliminary CPU-based support Slingshot-11 interconnect.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ultra-high performance communication with MPI and the Sun fireTM link interconnect

We present a new low-latency system area network that provides the ultra-high bandwidth needed to fuse a collection of large SMP servers into a capability cluster. The network adapter exports a remote shared memory (RSM) model that supports low latency kernel bypass messaging. The SunTM MPI library uses the RSM interface to implement a highly efficient memory-to-memory messaging protocol in whi...

متن کامل

Scalable High Performance Message Passing over InfiniBand for Open MPI

InfiniBand (IB) is a popular network technology for modern high-performance computing systems. MPI implementations traditionally support IB using a reliable, connection-oriented (RC) transport. However, per-process resource usage that grows linearly with the number of processes, makes this approach prohibitive for large-scale systems. IB provides an alternative in the form of a connectionless u...

متن کامل

High Performance Broadcast Support in La-Mpi Over Quadrics

LA-MPI is a unique MPI implementation that provides network-level fault-tolerant message passing. This paper describes the efficient implementation of a scalable MPI broadcast algorithm. LA-MPI implements a generic version of the broadcast algorithm using a spanning tree method built on top of point-to-point messaging. However, the Quadrics network, with it’s hardware broadcast support, provide...

متن کامل

Open MPI: A Flexible High Performance MPI

A large number of MPI implementations are currently available, each of which emphasize different aspects of high-performance computing or are intended to solve a specific research problem. The result is a myriad of incompatible MPI implementations, all of which require separate installation, and the combination of which present significant logistical challenges for end users. Building upon prio...

متن کامل

Analyzing MPI performance over 10-Gigabit ethernet

Recent work with 10-Gigabit (10GbE) network adapters has demonstrated good performance in TCP/IP-based localand wide-area networks (LANs and WANs). In the present work we present an evaluation of host-based 10GbE adapters in a system-area network (SAN) in support of a cluster. This evaluation focuses on the performance of the message-passing interface (MPI) when running over a 10GbE interconnec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Computer Science and Technology

سال: 2023

ISSN: ['1666-6046', '1666-6038']

DOI: https://doi.org/10.1007/s11390-023-2907-5